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4 <110> APPLICANT: Lukyanov, Sergey 

5 Fradkov, Arcady 

6 Labas, Yulii 

7 Matz, Mikhail |P 

8 Lukyanov, Konstantin E 

9 Gurskaya, Nadezda 

11 <120> TITLE OF INVENTION: FAR RED SHIFTED FLUORESCENT PROTEINS 

14 <130> FILE REFERENCE: CLON-028WO 

16 <140> CURRENT APPLICATION NUMBER: US/09/976,67 3 ^ 

17 <141> CURRENT FILING DATE: 2001-10-12 

19 <150> PRIOR APPLICATION NUMBER: 60/240,018 

20 <151> PRIOR FILING DATE: 2000-10-12 

22 <150> PRIOR APPLICATION NUMBER: 60,306,131 

23 <151> PRIOR FILING DATE: 2001-07-16 
25 <160> NUMBER OF SEQ ID NOS : 26 

27 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

29 <210> SEQ ID NO: 1 

30 <211> LENGTH: 910 

31 <212> TYPE: DNA 

32 <213> ORGANISM: heteractis crispa 

34 <400> SEQUENCE: 1 

35 accatttgct ttggttcctt ggcaaacgaa agtttagaac gaaaactgac ccaaattaca 60 

36 tcttcctcct ggatccttac catggctggt ttgttgaaag aaagtatgcg catcaagatg 120 

37 tacatggaag gcacggttaa tggccattat ttcaagtgtg aaggagaggg agacggcaac 180 

38 ccatttacag gtacgcagag catgaggatt catgtcaccg aaggggctcc attaccattt 240 

39 gccttcgaca ttttggcacc gtgttgtgag tacggcagca ggacctttgt ccaccatacg 300 

40 gcagagattc ccgatttctt caagcagtct ttccctgaag gctttacttg ggaaagaacc 360 

41 acaacctatg aagatggagg cattcttact gctcatcagg acacaagcct ggaggggaac 420 

42 tgccttatat acaaggtgaa agtccttggt accaattttc ctgctgatgg ccccgtgatg 4 80 

43 aagaacaaat caggaggatg ggagccatgc actgaggtgg tttatccaga gaatggtgtc 540 

44 ctgtgtggac gtaatgtgat ggcccttaaa gtcggtgatc gtcgtttgat ctgccatctc 600 

45 tatacttctt acaggtccaa gaaagcagtc cgtgccttga caatgccagg atttcatttt 660 

46 acagacatcc gccttcagat gccgaggaaa acgaaagacg agtactttga actgtacgaa 720 

47 gcatctgtgg ctaggtacag tgatcttcct gaaaaagcaa attgattgtt cccagtgaca 780 
4 8 ccagactgct gtcagctttt ggttaaagcc cgaaagacaa aaggacattt gtagtttagt 840 

49 ttatatttcc ctttcatttg tgaatcaaca ttgtactctc tgtaaacctt taaaatgctc 900 

50 cattaaacct 910 

52 <210> SEQ ID NO: 2 

53 <211> LENGTH: 227 

54 <212> TYPE: PRT 

55 <213> ORGANISM: heteractis crispa 

57 <400> SEQUENCE: 2 

58 Met Ala Gly Leu Leu Lys Glu Ser Met Arg lie Lys Met Tyr Met Glu 

59 1 5 10 15 

60 Gly Thr Val Asn Gly His Tyr Phe Lys Cys Glu Gly Glu Gly Asp Gly 

61 20 25 30 

62 Asn Pro Phe Thr Gly Thr Gin Ser Met Arg lie His Val Thr Glu Gly 
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67 


65 










70 
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68 
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Glu 
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Trp 


Glu 


Arg 
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69 








85 
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70 


Glu 


Asp 


Gly 


Gly 


He 


Leu 


Thr 


Ala 


His 


Gin 


Asp 


Thr 


Ser 


Leu 




pi v 


71 








100 










105 










lift 






72 
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Leu 


lie 
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Lys 


Val 


Lys 


Val 
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Thr 


Asn 


Phe 


Pro 
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73 






115 










120 










1 n e 

J.ZJ 








74 


Asp 


Gly 


Pro 


Val 
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Asn 


Lys 


Ser 


pi „ 
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Trp 




Pro 


y o 


J. 11X 


75 
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lj j 










140 










76 


Glu 


Val 


Val 


Tyr 


Pro 


Glu 


Asn 


Gly 


Val 


Leu 


Cys 


Gly 


Arg 


Asn 


Val 


Met 


77 


145 










150 










155 










160 


78 


Ala 


Leu 


Lys 


Val 


Gly 


Asp 


Arg 


Arg 


Leu 


He 


Cys 


His 


Leu 


Tyr 


Thr 


Ser 


79 










165 










170 










175 




80 


Tyr 


Arg 


Ser 


Lys 


Lys 


Ala 


Val 


Arg 


Ala 


Leu 


Thr 


Met 


Pro 


Gly 


Phe 


His 


81 








180 










185 










190 






82 


Phe 


Thr 


Asp 


He 


Arg 


Leu 


Gin 


Met 


Pro 


Arg 


Lys 


Thr 


Lys 


Asp 


Glu 


Tyr 


83 






195 










200 










205 








84 


Phe 


Glu 


Leu 


Tyr 


Glu 


Ala 


Ser 


Val 


Ala 


Arg 


Tyr 


Ser 


Asp 


Leu 


Pro 


Glu 


85 




210 










215 










220 










86 


Lys 


Ala 


Asn 





























87 225 

90 <210> SEQ ID NO: 3 

91 <211> LENGTH: 908 

92 <212> TYPE: DNA 

93 <213> ORGANISM: heteractis crispa 

95 <400> SEQUENCE: 3 

96 accatttget ttggttcctt ggcaaacgaa agtttagacg aaaactgacc caaattacat 60 

97 cctcctgatc cttaccatgg ctggtttgtt gaaagaaagt atgegcatea agatgtacat 12 0 

98 ggaaggcacg gttaatggcc attatttcaa gtgtgaagga gagggagacg gcaacccatt 180 

99 tacaggtacg cagagcatga ggattcatgt caccgaaggg gctccattac catttgeett 24 0 

100 cgacattttg gcaccgtgtt gtgagtaegg cagcaggacc tttgtccacc ataeggcaga 300 

101 gattcccgat ttcttcaagc agtctttccc tgaaggcttt acttgggaaa gaaccacaac 360 

102 ctatgaagat ggaggcattc ttactgetea tcaggacaca agectggagg ggaactgect 420 

103 tatatacaag gtgaaagtcc ttggtaccaa ttttcctget gatggccccg tgatgaagaa 4 80 

104 caaatcagaa ggatgggagc catgeactga ggtggtttat ccagataatg gtgtcctgtg 540 

105 tggacgtaat gtgatggccc ttaaagtegg tgatcgtcgt ttgatctgee atctctatac 600 

106 ttcttacagg tccaagaaag cagtccgtgc cttgacaatg ccaggatttc attttacaga 6 60 

107 catccgcctt cagatgeega ggaaaacgaa agacgagtac tttgaactgt acgaagcatc 720 

108 tgtggctagg tacagtgatc ttcctgaaaa agcaaattga ttgttcccag tgacaccaga 7 80 

109 ctgctgtcag cttttggtta aagcccgaaa gacaaaagga catttgtagt tttagtttat 840 

110 attttccctt tcattttgtg aatcaacatt gtactctctg taaaccttta aaatgetcca 900 

111 ttaaacct 908 

113 <210> SEQ ID NO: 4 

114 <211> LENGTH: 227 

115 <212> TYPE: PRT 
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116 <213> ORGANISM: 
118 <4 00> SEQUENCE: 



heteractis crispa 
4 



119 Met Ala Gly Leu Leu Lys Glu Ser Met Arg lie Lys Met 

120 15 10 

121 Gly Thr Val Asn Gly His Tyr Phe Lys Cys Glu Gly Glu 



122 



20 



25 



123 Asn Pro Phe Thr Gly Thr Gin Ser Met Arg lie 



124 



35 



40 



125 Ala Pro Leu Pro Phe Ala Phe Asp lie Leu Ala 



126 



50 



55 



127 Gly Ser Arg Thr Phe Val His His Thr Ala Glu 



His Val 

45 
Pro Cys 
60 

lie Pro 



128 65 



70 



75 



129 Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr 

130 85 90 

131 Glu Asp Gly Gly lie Leu Thr Ala His Gin Asp Thr Ser 



132 



100 



105 



133 Asn Cys Leu lie Tyr Lys Val Lys Val Leu Gly 

134 115 120 

135 Asp Gly Pro Val Met Lys Asn Lys Ser Glu Gly 

136 130 135 

137 Glu Val Val Tyr Pro Asp Asn Gly Val Leu Cys 

138 145 150 155 

139 Ala Leu Lys Val Gly Asp Arg Arg Leu lie Cys 



Thr Asn 
125 
Trp Glu 
140 

Gly Arg 



His Leu 

141 Tyr Arg Ser Lys Lys Ala Val Arg Ala Leu Thr Met Pro 



140 



165 



170 



142 



180 



185 



143 Phe Thr Asp lie Arg Leu Gin Met Pro Arg Lys 



144 



195 



200 



145 Phe Glu Leu Tyr Glu Ala Ser Val Ala Arg Tyr 



Thr Lys 
205 
Ser Asp 
220 



Tyr Met Glu 
15 

Gly Asp Gly 
30 

Thr Glu Gly 

Cys Glu Tyr 

Asp Phe Phe 
80 

Thr Thr Tyr 
95 

Leu Glu Gly 
110 

Phe Pro Ala 

Pro Cys Thr 

Asn Val Met 
160 

Tyr Thr Ser 

175 
Gly Phe His 
190 

Asp Glu Tyr 
Leu Pro Glu 



146 210 215 

147 Lys Ala Asn 

148 225 

151 <210> SEQ ID NO: 5 

152 <211> LENGTH: 684 

153 <212> TYPE: DNA 

154 <213> ORGANISM: heteractis crispa 

156 <400> SEQUENCE: 5 

157 atggctggtt tgttgaaaga aagtatgcgc atcaagatgt acatggaagg 

158 ggccattatt tcaagtgtga aggagaggga gacggcaacc catttacagg 

159 atgaggattc atgtcaccga aggggctcca ttaccatttg ccttcgacat 

160 tgttgtgagt acggcagcag gacctttgtc caccatacgg cagagattcc 

161 aagcagtctt tccctgaagg ctttacttgg gaaagaacca caacctatga 

162 attcttactg ctcatcagga cacaagcctg gaggggaact gccttatata 

163 gtccttggta ccaattttcc tgctgatggc cccgtgatga agaacaaatc 

164 gagccaagca ctgaggtggt ttatccagag aatggtgtcc tgtgtggacg 

165 gcccttaaag tcggtgatcg tcgtttgatc tgccatctct atacttctta 

166 aaagcagtcc gtgccttgac aatgccagga tttcatttta cagacatccg 

167 ccgaggaaaa cgaaagacga gtactttgaa ctgtacgaag catctgtggc 

168 gatcttcctg aaaaagcaaa ttga 



cacggttaat 60 
tacgcagagc 120 
tttggcaccg 180 
cgatttcttc 240 
agatggaggc 300 
caaggtgaaa 360 
aggaggatgg 420 
taatgtgatg 480 
caggtccaag 540 
ccttcagatg 600 
taggtacagt 660 
684 
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170 <210> SEQ ID NO: 6 

171 <211> LENGTH: 227 

172 <212> TYPE: PRT 

173 <213> ORGANISM: heteractis crispa 
175 <400> SEQUENCE: 6 
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40 










45 
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Ala 
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Pro 
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Ala 
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He 


Leu 


Ala 


Pro 
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\— y o 
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Tyr 


183 




50 










55 










60 










184 


Gly 


Ser 
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Thr 


Phe 


Val 


His 


His 


Thr 


Ala 


Glu 


He 


Pro 


Asp 


c llt2 




185 


65 










70 










7 s 










o u 


186 


Lys 


Gin 


Ser 


Phe 


Pro 


Glu 


Gly 


Phe 


Thr 


Trp 


Glu 


Arg 


Thr 


Thr 


Thr 


Tyr 


187 










85 










90 










95 




188 


Glu 


Asp 


Gly 


Gly 


He 


Leu 


Thr 


Ala 


His 


Gin 


Asp 


Thr 


Ser 


Leu 


Glu 


Gly 


189 








100 










105 










110 




190 


Asn 


Cys 


Leu 


He 


Tyr 


Lys 


Val 


Lys 


Val 


Leu 


Gly 


Thr 


Asn 


Phe 


Pro 


Ala 


191 






115 










120 










125 








192 


Asp 


Gly 


Pro 


Val 


Met 


Lys 


Asn 


Lys 


Ser 


Gly 


Gly 


Trp 


Glu 


Pro 


Ser 


Thr 


193 




130 










135 










140 










194 


Glu 


Val 


Val 


Tyr 


Pro 


Glu 


Asn 


Gly 


Val 


Leu 


Cys 


Gly Arg 


Asn 


Val 


Met 


195 


145 










150 










155 










160 


196 


Ala 


Leu 


Lys 


Val 


Gly 


Asp 


Arg 


Arg 


Leu 


He 


Cys. 


His 


Leu 


Tyr 
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Ser 


197 










165 










170 










175 




198 


Tyr 


Arg 


Ser 


Lys 


Lys 


Ala 


Val 


Arg 


Ala 


Leu 


Thr 


Met 


Pro 


Gly 


Phe 


His 


199 








180 










185 










190 






200 


Phe 


Thr 


Asp 


He 


Arg 


Leu 


Gin 


Met 


Pro 


Arg 


Lys 


Thr 


Lys 


Asp 


Glu 


Tyr 


201 






195 










200 










205 








202 


Phe 


Glu 


Leu 


Tyr 


Glu 
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Ser 


Val 


Ala 


Arg 


Tyr 


Ser 


Asp 


Leu 


Pro 


Glu 



203 210 215 220 

204 Lys Ala Asn 

205 225 

208 <210> SEQ ID NO: 7 

209 <211> LENGTH: 681 

210 <212> TYPE: DNA 

211 <213> ORGANISM: heteractis crispa 

213 <400> SEQUENCE: 7 

214 tctggtttgt tgaaagaaag tatgcgcatc aagatgtaca tggaaggcac ggttaatggc 60 

215 cattatttca agtgtgaagg agagggagac ggcaacccat ttgcaggtac gcagagcatg 120 

216 aggattcatg tcaccgaagg ggctccatta ccatttgcct tcgacatttt ggcaccgtgt 180 

217 tgtgagtacg gcagcaggac ctttgtccac catacggcag agattcccga tttcttcaag 240 

218 cagtctttcc ctgaaggctt tacttgggaa agaaccacaa cctatgaaga tggaggcatt 300 

219 cttactgctc atcaggacac aagcctggag gggaactgcc ttatatacaa ggtgaaagtc 360 

220 cttggtacca attttcctgc tgatggcccc gtgatgaaga acaaatcagg aggatgggag 420 

221 ccaagcactg aggtggttta tccagagaat ggtgtcctgt gtggacgtaa tgtgatggcc 480 

222 cttaaagtcg gtgatcgtcg tttgatctgc catcactata cttcttacag gtccaagaaa 54 0 
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229 
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239 
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244 
245 
246 
247 
248 
249 
250 
251 
252 
253 
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gcagtccgtg ccttgacaat gccaggattt cattttacag 
aggaaagaga aagacgagta ctttgaactg tacgaagcat 
cttcctgaaa aagcaaattg a 
<210> SEQ ID NO: 8 
<211> LENGTH: 226. 
<212> TYPE: PRT 

<213> ORGANISM: heteractis crispa 
<400> SEQUENCE: 8 

Ser Gly Leu Leu Lys Glu Ser Met Arg lie Lys 

15 10 
Thr Val Asn Gly His Tyr Phe Lys Cys Glu Gly 

20 25 
Pro Phe Ala Gly Thr Gin Ser Met Arg lie His 



acatccgcct tcagatgctg 600 
ctgtggctag gtacagtgat 660 

681 



35 



40 



Pro Leu Pro Phe Ala Phe Asp lie Leu Ala Pro 



50 



55 



Ser Arg Thr Phe Val His His Thr Ala Glu lie 



Met 

Glu 

Val 

Cys 

60 

Pro 



65 
Gin 



70 



75 



85 



90 



Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr 

lr Ser 
ir Asn 



Asp 


Gly 


Gly 


He 


Leu 








100 




Cys 


Leu 


He 


Tyr 


Lys 






115 






Gly 


Pro 


Val 


Met 


Lys 




130 








Val 


Val 


Tyr 


Pro 


Glu 


145 










Leu 


Lys 


Val 


Gly 


Asp 










165 


Arg 


Ser 


Lys 


Lys 


Ala 








180 




Thr 


Asp 


He 


Arg 


Leu 



105 



120 



135 
Asn Gly 
150 



155 
His 



170 



185 

n Met Leu Arg Lys Glu 
195 200 
Glu Leu Tyr Glu Ala Ser Val Ala Arg Tyr Ser 

210 215 
Ala Asn 
225 

<210> SEQ ID NO: 9 
<211> LENGTH: 681 
<212> TYPE: DNA 

<213> ORGANISM: heteractis crispa 
<400> SEQUENCE: 9 

tctggtttgt tgaaagaaag tatgcgcatc aagatgtaca 
cattatttca agtgtgaagg agagggagac ggcaacccat 
aggattcatg tcaccgaagg ggctccatta ccatttgcct 
tgtgcgtacg gcagcaggac ctttgtccac catacggcag 
cagtctttcc ctgaaggctt tacttgggaa agaaccacaa 
cttactgctc atcaggacac aagcctggag gggaactgcc 



Glu 
140 
Arg 

His 

Pro 

Lys 

Asp 
220 



Tyr 


Met 


Glu 


tj±y 






J. D 




Gly 


Asp 


C 1 T T 

(jiy 


Asn 




JU 






Thr 


Glu 


Gly 


AJ_a 










Cys 


LtJLU 


Tyr 


(j-Ly 


Asp 


Phe 


Phe 


Lys 








80 


Thr 


Thr 


Tyr 


Glu 






95 




Leu 


Glu 


Gly 


Asn 




110 






Phe 


Pro 


Ala 


Asp 


125 








Pro 


Ser 


Thr 


Glu 


Asn 


Val 


Met 


Ala 








160 


Tyr 


Thr 


Ser 


Tyr 






175 




Gly 


Phe 


His 


Phe 




190 






Asp 


Glu 


Tyr 


Phe 


205 








Leu 


Pro 


Glu 


Lys 



tggaaggcac 
ttgcaggtac 
tcgacatttt 
agattcccga 
cctatgaaga 
ttatatacaa 



ggttaatggc 60 
gcagagcatg 120 
ggcaccgtgt 180 
tttcttcaag 240 
tggaggcatt 300 
ggtgaaagtc 360 
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